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Abstract 

Data transfer in opportunistic Delay Tolerant Networks (DTNs) must rely on unscheduled sporadic meetings 
between nodes. The main challenge in these networks is to develop a mechanism based on which nodes can learn to 
make nearly optimal forwarding decision rules despite having no a-priori knowledge of the network topology. The 
forwarding mechanism should ideally result in a high delivery probability, low average latency and efficient usage of 
the network resources. In this paper, we propose both centralized and decentralized single-copy message forwarding 
algorithms that, under relatively strong assumptions about the networks behaviour, minimize the expected latencies 
from any node in the network to a particular destination. After proving the optimality of our proposed algorithms, we 
develop a decentralized algorithm that involves a recursive maximum likelihood procedure to estimate the meeting 
rates. We confirm the improvement that our proposed algorithms make in the system performance through numerical 
simulations on datasets from synthetic and real-world opportunistic networks. 

Index Terms 

Delay Tolerant Networks, Opportunistic Forwarding, Meeting Rate Estimation 


I. Introduction 


D elay or Disruption Tolerant Networks (DTNs) are a class of wireless mobile node networks in which the 
communication path between any pair of nodes is frequently unavailable. Nodes are thus only intermittently 
connected. DTNs were first studied in the 1990s when the research community considered how the Internet could 
be adapted for space communications Q. Later, it was recognized that DTNs were a suitable model for several 
terrestrial networks. 


The authors are with the Computer Networks Research Laboratory, Department of Electrical and Computer Engineering, McGill University, 
Montreal, QC H3A 0E9, Canada (e-mail: shohreh.shaghaghian@mail.mcgill.ca; mark.coates@mcgill.ca). This research was supported by the 
Natural Sciences and Engineering Research Council of Canada (NSERC) via grant application 261528. 

Manuscript received April 3, 2015 


2 


DTNs can be categorized according to whether the node connections are scheduled (thus predictable) or random 
(hence unpredictable). Space communication networks fall into the first group. Networks belonging to the second 
category, which are the focus of this paper, are also referred to as opportunistic networks, because nodes seize 
the opportunity to transfer data when a communication channel becomes available. Opportunistic networks have 
been studied intensively in recent years (e.g., Q-Q) because they can fulfil a number of useful purposes, such 
as non-intrusive wildlife tracking (e.g., ZebraNet Q and SWIM Q), emergency response in disaster scenarios 
(e.g., ChaosFIRE Q), provision of data communication to remote and rural areas (e.g., DakNet 0), and traffic 
offloading in cellular networks (e.g., Q). 

In most opportunistic networks, the nodes are highly mobile and have a short radio range, and the density of 
nodes is low. In many cases, nodes have limited power and memory resources. These attributes combine with 
the intermittent connections to make routing traffic challenging. Routing is usually based on a store-carry-forward 
mechanism that exploits node mobility. In this mechanism, the source transmits its message to a node it meets. 
This intermediate node stores and then carries the received message until it meets another node to which it can 
forward the message. This process is repeated until the message reaches its destination. The key ingredients in 
designing an opportunistic network routing protocol are the forwarding decisions: should a node forward a message 
to a neighbour it meets? should it retain a copy for itself? 

Although much research effort has been devoted to the development of opportunistic network routing algo¬ 
rithms iHl-ilig, the algorithms are either centralized, have no performance guarantees, or ignore the need to 
estimate network parameters. Our work focuses on the mobile ad-hoc network (MANET) setting, where node speed 
is much reduced compared to the vehicular ad-hoc (VANET) case, and we can assume that there are fewer restrictions 
on the amount of data that nodes can transfer when they meet. In this paper, we derive a decentralized routing 
algorithm that has performance guarantees (under simplifying assumptions about the network behaviour). When the 
meeting times between nodes are independent and exponentially distributed, the routing algorithm minimizes the 
expected latency in sending a packet from any source node to a specific destination. We examine the behaviour of 
the routing algorithm when the meeting rates are learned online using a recursive maximum likelihood procedure. 
We show that, for a stationary network, the decision rules and achieved expected latencies converge to those 
obtained when there is exact knowledge of the meeting rates. We present the results of simulations that compare 
the performance of the proposed algorithm to previous approaches, and examine how the algorithm is affected by 
practical network limitations (finite buffers, restrictions on data exchange, message expiry times). 


Organization: The paper is organized as follows. In the following subsection, we discuss related work. In 
Section we describe our system model and formulate the routing problem. In Section III we present the 
forwarding algorithms and discuss their optimality under the network modeling assumptions. We present numerical 
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simulation results in Section IV and make concluding remarks in Section [V] 


A. Related Work 

The first proposed approaches for routing in opportunistic networks were based on extending the concept of 
flooding to intermittently connected mobile networks. In these replication-based methods, a node forwards the 
messages stored in its buffer to all of (or to a fraction of) the nodes it encounters. There is no attempt to evaluate 
the capability of a given node to expedite the delivery. These routing algorithms have few parameters: they determine 
only how much replication can occur and which nodes can make copies of packets. One of the earliest algorithms 


was epidemic routing \ 101, in which a node forwards a message to any node it meets, provided that node has not 


previously received a copy of the message. Thus messages are quickly distributed through the connected portions of 
the network. Other replication-based approaches (pT|, fT^ , [281, | [29| ) manage to reduce the transmission overhead 
of epidemic routing and improve its delivery performance through modification of the replication process and 
prioritization of messages. Models have been developed that allow an analytical characterization of the performance 
of the epidemic routing techniques 1301, m- Replication-based approaches result in a high probability of message 
delivery since more nodes have a copy of each message, but they can produce network congestion. 

A step towards achieving more efficient routing approaches is to consider the history of node contacts in the 
network instead of blindly forwarding packets. History-based (also called utility-based) routing algorithms assume 
that nodes’ movement patterns are not completely random and that future contacts depend on the frequency and 
duration of past encounters. Based on these past observations, both the source and the intermediate nodes decide 
whether to forward a message to nodes they encounter or to store it and wait for a better opportunity. An early 


example is |13|, which extends epidemic routing to situations with limited resources, incorporating a dropping 
strategy for the case when the buffer of a node is full. The dropping decisions are based on the meeting history of 


the node. PRoPHET 1141 assigns a delivery probability metric to each node which indicates how likely it is that the 
message will be delivered to the destination by that particular node. This metric is updated each time two nodes 


meet, and thus takes into account the history of meetings in the network. MaxProp 115| and MEED |16| are other 


examples of history-based algorithms proposed for vehicular DTNs. In these networks, nodes move with higher 
speeds, reducing the amount of time they are in each other’s radio range. Hence, the two main limiting resources 
are the duration of time that nodes are able to transfer data and their storage capacities. 

Other researchers have examined whether it is possible to exploit other characteristics of opportunistic networks 
to improve the performance of routing algorithms. Since social interactions often determine when connections 
between nodes occur, several algorithms strive to use social network concepts like betweenness centralities (e.g. 
in))- or community formations (e.g. |[T8|-p0|). Other algorithms have attempted to take advantage of the strategic 
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behaviour of nodes (e.g. |2TJ, 0 ). Our work focuses on routing a message to a single destination, but there 
are connections to research that addresses the task of spreading information to multiple nodes in a network. Of 
particular interest is the gossip-based approach in which greatly reduces the number of message copies in the 
network while achieving near-optimal dissemination. 

The experimental-based studies demonstrate the efficiency of their proposed methods by running simulations on 
traces recorded from real world opportunistic networks. Experimental analyses are valuable and take into account 
practical considerations, but they can leave us with an incomplete understanding of how an algorithm operates and 
how it will perform in other untested network conditions. For example, the behaviour of PRoPHET has been shown 


to be very sensitive to parameter choice \32\. It is also useful to design an optimal algorithm under slightly less 
realistic modeling assumptions, and then consider how it can be adapted to address the practical limitations, without 
completely losing its desirable features. More recent studies have focused on deriving a forwarding process whose 


optimality (in some sense) can be mathematically proved under assumptions about network behaviour. 1241 extends 
the two hop relay strategy of pT| by considering the expected delivery time to the destination as a metric to find 
fhe best set of candidate relays. By increasing the number of relaying steps recursively, a centralized single-copy 
multi-hop opportunistic routing scheme is proposed for sparse DTNs. The main defect of a centralized approach 
is that global knowledge of the network is required in order to make forwarding decisions. 

There have been some efforts towards migrating to decentralized solutions that still provide performance guar¬ 


antees. 1251 proposes a decentralized time-sensitive algorithm called TOUR in which message priority is taken into 
account in addition to nodes’ expected latencies when making forwarding decisions. Although in TOUR each node 
only needs to be aware of the local information about the rates of contacts with its own set of neighbours, the 
algorithm assumes that the node knows the exact contact rates. In most practical scenarios, this assumption is not 
valid. 

Some researchers have explored how imprecision in the measurement or estimation of network parameters can 


impact the performance of opportunistic network routing algorithms. In |26|, 127|, Boldrini et al. discuss different 
sources of errors that may exist in parameter estimation like missed encounters, incorrect combination of short 
contacts, and memory limitations. They model these errors as a random variable with a normal distribution and 
evaluate the performance of four different forwarding schemes under this model. Although this error analysis is 
useful, Boldrini et al. do not specify how parameters should be estimated in order to obtain a performance that 
approaches what can be achieved when perfect a-priori knowledge of the network parameters is available. 


Some of the results in this paper were presented in an earlier conference paper |33|, but here we include more 
extensive experimental analysis and additional theoretical results. 








5 


II. System Model 

We consider a network of N mobile nodes which aim to send messages to a particular destination node d. 
The set of nodes is denoted by J\f. We assume that the random inter-meeting times of nodes are independent and 
exponentially distributed with parameter Xij for nodes i and j. 

Although the aggregate intermeeting distributions of nodes in mobile ad-hoc networks often follow a truncated 
power law p4|, |j^, there is evidence that the intermeeting times of individual pairs of nodes can be adequately 


modeled by exponential distributions with heterogeneous coefficients 136|-|39|. In particular, Conan et al. |36| 


and Gao et al. |37| conduct statistical analyses of mobile social network data traces, including the Infocom data 


set 1401 that we analyze in Section IV They demonstrate that most pairs of nodes have intermeeting times that are 


approximately exponentially distributed. In |38|, |39|, approximately exponential distributions of individual meeting 
times are detected through statistical analyses of car/taxi mobility traces. 

We associate with the network a contact graph which is formed by adding a link between any two nodes that 
meet. We assume that the contact graph is connected and denote the set of neighbors of node i in this graph by Si. 
Since the contacts between nodes are not pre-scheduled, we cannot identify end-to-end paths ahead of time. Hence, 
solving the routing task is equivalent to identifying the forwarding decisions that nodes should make when meeting 
each other. We assume that nodes’ buffer sizes are unlimited, message Time To Live (TTL) is infinity and that 
nodes’ speed and message lengths are such that any number of messages can be forwarded during each meeting. 
We consider only algorithms that do not involve replication. In the class of algorithms we consider, each time node 
i meets one of its neighbors j G Si, it forwards a message destined for d with probability pij. Considering the 
matrix Patxat comprised of all pairs i and j, we set pij = 0 if nodes i and j never meet and are thus not neighbors 
in the contact graph. We denote the forwarding probabilities of node i by the vector p*; this is the i-th row of the 
matrix P. 

The expected latency from node i to destination d is a function of the probability decision matrix P and we 
denote it by Lid(P). Our goal is to find the matrix P* such that the sum of the expected latencies of all the nodes 
in the network to the specified destination d is minimized. Let us call this utility function U{P) = Yhi&M Lid{P). 
We assume that the network topologies and meeting rates are such that the solution P* is unique. If not, our 
algorithms guarantee that we reach one of the optimal matrices, but the proofs are more complicated. The first step 
towards achieving this goal and finding matrix P* is to discover how the expected latency of an arbitrary node i, 
^id(P). depends on the elements of the probability decision matrix P in general. Lemma provides an expression 
for Lid{P) in terms of P and \ij,j G Si. The proof is available in Appendix [A| 










6 


Lemma 1. The expected latency of a node i ^ N to the destination d is 


Lid{P) 


Pij\jLjd{^) 

ij 


( 1 ) 


Based on the expression derived in Lemma [TJ the expected latency of each node to the destination depends on 
the expected latencies of its neighbours. This result raises a substantial question: Does the probability decision 
matrix that minimizes the sum of expected latencies of all nodes of the network (P*), also minimize the expected 
latency of each individual node? Before continuing to propose algorithms for finding P*, we answer this question 
and make two points about the structure of P* through the following theorem. The proof is provided in Appendix 

m 


Theorem 1. Suppose P* = argminp^^ Then: 

(1) P* is a binary matrix (its components are either 0 or Ij. 

(2) For any i G N, the matrix P* also minimizes Lid(F): 


yi e Af : P* = arg min LidCP) (2) 

Theorem [T] shows that the minimization problem is actually a binary problem. Each time node i meets one of its 
neighbours j G Si, it either forwards the message or keeps it. From now on, we change our notation and use the 
binary indicator matrix B instead of P to capture this binary decision. Therefore, the optimization takes the form: 

N 

B* = arg min VLid(B) (3) 

2=1 

Theorem [T] also states that the optimum solution matrix B* can be equivalently achieved by minimizing the 
expected latency of each of the network nodes to the destination. This is the main idea of developing centralized 
and decentralized algorithms for finding B*. In the next section, we introduce the algorithms we have proposed 
for solving this optimization problem and prove that they find the optimal solution. 


III. Algorithms 

In the first part of this section, we try to to find B* in a centralized fashion where the whole topology and 
meeting rates of the network are available at a central unit. This unit calculates a binary matrix B and informs the 
nodes about the neighbours they should forward their buffered messages to. We prove that the solution achieved 
upon completion of this algorithm is the same as the optimum solution B*. In the second part of the section, we 
introduce a decentralized algorithm and prove that it converges to the same global solution with probability 1. The 
advantage of the decentralized approach is that no node needs to have a global knowledge of the network and each 
node can learn its own optimal forwarding decisions. The only piece of information a node needs to know is its 
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meeting rates with its own neighbours. Finally in the last part of the section, we make our model more realistic by 
assuming that nodes have no a-priori knowledge of any meeting rates. In this more practically realistic scenario, 
each node estimates the meeting rates with its neighbours, updating its estimates each time a contact occurs. 


Algorithm 1 Centralized Greedy Latency Minimization 


1 

2 

3 

4 

5 

6 
7 


8 

9 

10 

11 

12 

13 

14 


// Initialization 


A = {d}, B = OnxN, Ldd = 0, Ljd = oo for j / d 
// Iteratively add nodes to the set 

while ^ / AA do 

for each node i ^ A do 
Identify Si_A_ = 5^ n ^ 


Calculate Gid for m* G {0, 


Gid = minr 


i+E 




\ij L j d 




rriijXi 


Denote Vi = {j\j G Si_A,m*j = 1} 

end for 

Identify v = argminjg^/_4 
Set Lyd ^vd 
Set byj = 1 for all j G Vi 
Set ^ ^ U {u} 

end while 


A. Centralized Approach with Global Knowledge 

Suppose for each node i G M, the set of neighbours Si and their meeting rates Xij,j G Si are known at a 
central calculation unit. Algorithm [T] presents an iterative procedure to identify a binary decision matrix B. In this 
algorithm, we first decide on the forwarding rules of the node that has the most frequent direct contacts with the 
destination. We refer to this node as node A. In order to achieve the minimum expected latency to the destination, 
node A should forward its generated or received messages only to the destination, ignoring its meetings with any 
other nodes. All other nodes that A encounters meet the destination less frequently and, if they forward their 
messages to the destination through other nodes, these other nodes also meet the destination less frequently than 
A. In subsequent steps of the algorithm, we consider all the nodes that have direct contacts with the nodes whose 
forwarding decision rules have already been made (the set A) and calculate the minimum latency that each of 
them can obtain by forwarding through nodes in A to the destination. At the end of each iteration, we finalize 
the forwarding decision for the one node that can achieve the minimum latency and add it to A. We repeat the 
procedure until the decision has been made for all the nodes of the network and the elements of the binary matrix 
B have all been specified. The next theorem states that the binary matrix B resulting from applying this procedure, 
as specified concretely in Algorithm [T] achieves the minimum sum of expected latencies to the destination. The 
proof can be found in Appendix 








Theorem 2. Suppose all meeting rates are different and there exists a unique solution B* for the optimization 
problem 0). 

1) After each iteration of Algorithm^ 

a) Mi e A, Mj e Af : bij = h*- 

b) Mi£A: Lirf(B) = UffW) 

c) maxig^Lid(B*) < Lirf(B*) 

2) Upon completion, Algorithm^identifies a labelling B and associated expected latencies Lid such that B = B* 

Theorem demonstrates that the iterative optimization procedure expressed in Algorithm [1] finds the solution of 
the minimization problem in Q. If there is not a unique solution, then at some point in Algorithm [T] there will be 
multiple rrij that solve the optimization in line 7. It is straightforward to show that choosing any one of these nij 
leads to a decision matrix B that achieves the minimum expected latencies. 

B. Decentralized Approach with Partial A-priori Knowledge 

Suppose no central unit exists and each node is just aware of its own Si and the meeting rates \ij,j G Si. 
Algorithm demonstrates how nodes can make their binary forwarding decisions based on this local information. 
Since the expected latency of each node depends on the expected latency values of its neighbours, nodes need to 
have an estimation of their neighbours’ expected latencies to be able to make forwarding decisions. We denote 
by Lid{j) the estimate at node j of the latency from node i to the destination. In Algorithm]^ each time two 
nodes meet, they update these estimates and then recalculate their optimum forwarding rules. Theorem proves 
that this decentralized approach results in the same global optimum solution. The proof of Theorem is provided 
in Appendix 

Algorithm 2 Decentralized Greedy Latency Minimization 

1: // Initialization 

2: B = OatxAt ^ ^ 

3: Mi G Af/d,Mj G Af : Ldd{j) = 0, Lid{j) = oo 
4: while Nodes continue to meet do 
5: // Nodes i and j meet at time t 

6: Set Lid{i') 

1: Set Ljd{i) = Ljd{j) ^ 

T T 1 'r' / •\ ' ^ ^ y fc P iS • k^ik ^ k diff) 1*1 • r 1 ■■ ■■ * 

8: Update Lid[i) = mirij^ — m k\ k - identity the minimizing mj 

9: Update Ljdij) = mm v- and identify the minimizing m* 

J ^jh'Ajk J 

10: Set bj = m* and hj = m* 

11: end while 
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Theorem 3. The decision matrix B identified by Algorithm converges to B* with probability 1. 

We refer to our proposed decentralized greedy latency minimization algorithm (Algorithm as MinLat and 
evaluate its efficiency in different random and real-world networks based on certain performance metrics in Section 


IV Regarding the computational complexity of finding the minimum expected latency in MinLat, the following 


lemma shows that the optimizations in lines 8 and 9 of this algorithm are linear fractional programs and can he 
solved quickly using variants from linear programming. Further details are available in Appendix 

Lemma 2. The minimization problem in Algorithm 

1 “h ^^k£Si ^ik^ikLkdii') 


Lid{i) = min 

miSfO,!}'®' 


^^kcS ^ik^ik 


( 4 ) 


can be converted to a linear programming problem. 


Assuming that Q can be solved in polynomial order P(|5i|), the worst case complexity order of Algorithm [I] is 
0{N‘^)P{N) because in the zth round of this algorithm, Q should be solved for each of the N — i nodes that are 
not in the set A. In Algorithm each time node i meets one of its neighbours,it solves a problem of complexity 
PdS'il). The only information that a node needs to share when it meets another node is its estimate of its own 
expected latency to the destination. In the general case where messages can be destined to any node in the network, 
this exchangable message could be a length N vector of expected latencies to all nodes. 

The following proposition provides a bound on the expected convergence time of Algorithm The brief proof 
is provided in Appendix The bound depends on the slowest meeting rate between each node and its candidate 
relay nodes. This is a conservative bound, since in practice, a node only needs to meet the relay nodes to which it 
actually forwards data under the optimum forwarding rule. 


Proposition 1. The expected convergence time, E{Tpz), of Algorithm 


is bounded as E{T]\f) < 


1 

Aid 




1-1 


min 

^li>0 


C. Decentralized Approach with No A-priori Knowledge 


In part III-B we assumed that as soon as a node meets another node, it has a perfect knowledge of its meeting 
rate with that node. In practice, a node will need to estimate its meeting rates with the neighbours and periodically 
revise the estimation as meetings occur (or fail to occur). Consider an arbitrary pair of nodes that meet each other 
with rate A. We denote the intermeeting time, which is the time between and i + meetings, by x* > 0. 
For this specific pair of nodes, Xi is an independent sample of an exponentially distributed random variable with 
parameter A. Using the maximum likelihood (ML) approach we can estimate the parameter A after n samples. The 
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likelihood function £(A|xi, ..,x„) = X^e is maximized by 


A = 


n 


( 5 ) 


V” T- 

l^i=l 

Hence, under the exponential model, a node only needs to remember the last time it met its neighbour and the 
number of times it has met that neighbour. With these two pieces of information, it can update its estimation of 
the meeting rate (A") from the previously estimated value (A”“^) using the following equation. 


A" = 


n — 1 + A' 


n—lr, 


( 6 ) 


Based on this argument, we develop a more practical version of MinLat in which an arbitrary pair of nodes i and 
j use their estimated meeting rates Xij in their calculations and modify this estimation each time they meet. We 
refer to this version of MinLat as MinLat-E. 

Let t denote the time since the network began operating, and denote by Bt the decision matrix achieved by 
MinLat-E at time t. Lurther, denote by the estimate at node i at time t of the expected latency to the 

destination, when the forwarding decision matrix is B*. This estimate differs from that obtained in Algorithm 2, 
Lid{i), because the distributed algorithm calculates them using estimated meeting rates Xij. The following theorem 
states that the achieved expected latencies, and the estimated expected latencies, Ljrf^f(Bt,z), converge in 

probability to the optimum expected latencies Ljrf(B*). The proof is provided in Appendix]^ 

Theorem 4. For any node in the network, the sequences of estimated and achieved latencies converge to its optimum 
expected latency in probability, i.e., ^ Lid{^*) and {Lid(Bt)} A Lid(B*). More precisely for any 

e > 0, 

lim P{\L,d,t{^t, i) - Lid{^*)\ < e) = 1 (7) 

r—>oo 


lim P{\Ud{^t) - Ud{^*)\ <e) = l 

t—>oo 


( 8 ) 


We check the claims of Theorem and investigate the convergence speed of MinLat-E through simulations in 
Section |I3 


IV. Simulation Results 

In this section, we investigate the efficiency of our proposed approach in modeling and solving the forward¬ 
ing/routing problem in different opportunistic network scenarios. We hrst test our algorithms using three different 
networks to model the contacts between = 41 mobile nodes. The characteristics of the networks are derived 


from the InfocomOS dataset |40|. This data set is based on an experiment conducted during the IEEE Infocom 2005 
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conference in Miami where 41 Bluetooth enabled devices (Intel iMote) were carried hy attendees for 3 days. The 
start and end times of each contact between participants were recorded. The average time between node contacts in 
the InfocomOS dataset is 1.3 x 10“^ seconds (3.7 hours). In our processing, we only consider the contacts in which 
both devices recognized each other so that an acknowledged message could be transfered between them. 

In the first network, {Net I), we construct a contact graph using an evolving undirected network model based on 
the preferential attachment mechanism. We start with a small fully connected graph of mo = 5 vertices and add 
vertices to it one by one until the graph consists of iV = 41 nodes. At each step, the new vertex is connected to 
m = 5 previously existing vertices. The probability that the new vertex is connected to vertex i is where 

ki is the degree of i up to this stage. After building the contact graph, we assign a parameter Xij to each pair of 
nodes i and j which are connected in the contact graph and assume that they meet with exponentially distributed 
intermeeting times with parameter Xij. We choose the parameters Xij from a uniform distribution with the same 
expectation as the average of node meeting rates observed in the InfocomOS dataset. 

In Net II, we set Xij to be equal to the inverse of the average intermeeting time between nodes i and j in 
the InfocomOS dataset. We are interested in the behaviour of the algorithms in relatively sparse networks, so we 


limit the number of neighbours of each node: node i is only connected to node j in the contact graph if the 
meeting rate Xij is among the largest iT = 10 meeting rates of either node i or node j. In our simulations, the 
meeting times between nodes i and j for Net II are then chosen from an exponential distribution with parameter 
Xij. In the third experimental network. Net III, we use the actual meeting times recorded in the InfocomOS dataset. 
The analysis in | [37| indicates that the distribution of individual intermeeting times for most pairs of nodes can 
be approximated reasonably well by an exponential distribution; on the other hand, the aggregate distribution of 


contact times shows heavy-tailed behaviour and is better approximated using a truncated power distribution |34|, 
|. Table 1^ summarizes the properties of the test networks. 


TABLE I: Test Network Properties 


Net¬ 

work 

Contact 

Graph 

Parameters 

Intermeeting 

Times 

Number 

of nodes 

/ 

Preferential 

Attachment 

Too = 5 
m = 5 

Exponential 

A: uniform 

41 

II 

Sparsified 

Infocom 

A = 10 

Exponential 

A: data-set 

41 

III 

Sparsified 

Infocom 

A = 10 

Data-set times 

41 


As mentioned in Section we call our proposed decentralized greedy latency minimization algorithm MinLat 
and refer to its more practical version with meeting rate estimations as MinLat-E. In these two algorithms, the 
decisions that nodes make for future forwarding rules depend on the (estimated) meeting rates, which are derived 
from the frequency of past contacts between nodes. Thus, MinLat and MinLat-E can be identified as history-based 
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routing algorithms. In order to evaluate their performance, we compare them with existing history-hased routing 
protocols that can he implemented in a distributed fashion and do not need a-priori knowledge of the network 
topology. As mentioned in Section]^ the fixed point algorithm proposed in \24\ is proved to result in the minimum 
expected latency (which is expected to he the same as the result of our proposed centralized Algorithm [^. However, 


the proposed algorithm in |24| is centralized and needs to he performed in a control unit where the whole topology 


of the network is known. The TOUR algorithm proposed in |25| is decentralized, hut each node needs perfect 
a-priori knowledge of its meeting rates with other nodes. Also, the main focus of TOUR is to find fhe opfimum 


way fo make forwarding decisions based on fhe priorifies of messages. We have chosen PRoPHETv2 |32| and 
MaxProp as fhe mosf appropriafe candidafes for comparison. We also compare fo Epidemic roufing fTOl , 
which is expecfed fo resulf in a high delivery probabilify af fhe expense of high usage of nefwork resources. The 


paramefers of PRoPHETv2 are sef fo fhose suggesfed in |14| and |^, i.e., Pinit = 0.75, (3 = 0.25, 7 = 0.98, 
and time sfep = 1. In order fo pul fhe focus on evalualion of fhe performance efficiency of forwarding rules and 
eliminafe fhe effecl of fhe buffer managemenf technique on Ihis performance, we firsl fesf fhe algorilhms on ideal 
nefwork scenarios where fhe message life limes, buffer sizes and dala exchanges have no reslriclion. Therefore for 


Ihese simulalions, fhe dropping rules proposed for MaxProp in |15| are nol applied. We Ihen sludy fhe nefwork 
behaviour when Ihese praclical challenges are added fo fhe simulation selups. 

We divide fhe InfocomOS dalasel info slols of 12 hours. In each of Ihese lime slols, we build nelworks 1 fo 111 
using fhe nodes lhal are presenl in lhal period. The inlermeeling lime exponential paramefers (Ajjs) are estimated 
based on fhe meelings lhal occurred in lhal specific time slol and nelworks I and 11 are conslrucled using Ihese 
estimated parameters. Eor each of fhe firsl four 12 -hour periods (fhe firsl Iwo days of fhe conference), we send 
1000 messages, spaced by 5-second intervals, from randomly chosen source nodes fo a parlicular deslinalion. We 
lerminale fhe simulation al fhe end of fhe 3-day period, and calculale fhe fraction of messages successfully delivered 
by each of fhe single copy (PRoPHETv2, MinEal) and mulli-copy (Epidemic, MaxProp) forwarding algorilhms. 
Eor each algorifhm, we also calculale fhe average lalency of fhe messages lhal are delivered by all four algorilhms; 
fhe average number of hops lhal messages pass fo reach fhe deslinalion; and fhe average buffer occupancy of fhe 
nodes. Eor each of fhe 41 nodes of fhe nefwork, we calculate fhe average of performance melrics over fhe lime slols 
when lhal node has been chosen as a deslinalion. Eigure [T] shows fhe average and fhe 95% confidence inlervals 
of fhe four performance melrics for differenl desfinalions in fhe Ihree lesl nelworks using Epidemic, PRoPHETv2, 
MaxProp, and MinEal forwarding algorilhms. There is no reslriclion on message life time or buffer size lhal can 
cause message dropping in Ihese simulations. However, Ihe delivery rates in some cases are less lhan 1 because Ihe 
simulations are terminated before all of Ihe generated messages are successfully delivered. We observe lhal in all 
Ihe Ihree lesl nelworks, MinEal has a belter performance lhan Ihe olher existing hislory-base single copy algorilhm. 
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Fig. 1: Comparison Metrics in Test Networks with Af = 41 


PRoPHETv2, in terms of delivery rate and average latency. Its performance is also comparable to MaxProp which 
is a history-based multi-copy algorithm. Also, noting that the scale of the vertical axis of Figure [T^ is logarithmic, 
we see that MinLat occupies much less memory of nodes’ buffers in average. For networks 1 and 11, where the 
assumption of exponentially distributed intermeeting times holds, delivering a higher rate of messages with lower 
average latencies than PRoPHETv2 is expected from Theorem However, we observe that this result also holds 
for network III where the actual meeting times are used. All algorithms display slightly poorer performance in 
network ///; this is probably due to heavy-tailed and non-stationary intermeeting times. 

In order to explore how the incorporation of meeting rate estimations in MinLat-E affects the message delivery 
performance, we conduct further simulations with a different message generation scenario on network II. Figure 
1^ displays the average delivery latency as a function of time for the history-based routing algorithms PRoPHET, 
MaxProp, MinEat, and MinEat-E. The delivery latency values are averaged over 32 simulation runs while the 
destination node and message generation times are fixed in all the runs. Messages are generated with interarrival 
time of t seconds at randomly chosen source nodes where t is uniformly distributed in [0,5]. Each point on the curves 
represents the average latency of the 1500 most recently sent messages. As Eigure|^ shows, in all of the three single 
copy routing algorithms (PRoPHET, MinEat, MinEat-E), the average time it takes for a message to be delivered 
at the destination decreases as time goes by. This decreasing trend is due to the fact that the forwarding rules 
discovered by nodes improve as the nodes have more contacts and their information concerning their neighours’ 
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message delivery capabilities becomes more accurate. However, in the multi-copy routing algorithm, MaxProp, 
the average message delivery time increases in the beginning. In MaxProp, the weights assigned to the links are 
initialized to be equal, which means that nodes forward messages to more of their neighbours. There is thus a 
high level of message replication which leads to messages reaching the destination sooner on average. As time 
passes, the level of replication decreases and the average delivery time increases. We also observe that MinLat-E 
and MinLat eventually achieve the same average delivery latencies, as expected from Theorem However, the 
convergence to the optimum point is slower in MinLat-E due to the time it takes for nodes to obtain accurate 
estimates of their meeting rates. 

3.5 
3 

2.5 
2 

1.5 
1 

0 0.5 1 1.5 2 2.5 

xIO^ 

Eig. 2: Evolution of average delivery latency with time (both in seconds). 


xIO^ 



We examine the performance of the forwarding algorithms in larger networks by extending network I to 100 nodes 
but with the same average A parameter for exponential intermeeting times. We also make our model more realistic 
by adding some practical restrictions to the network model. Eirst, we assume that messages have finite TTL, i.e., 
a message is discarded when its lifetime exceeds a certain threshold. Eigure displays the performance of routing 
algorithms for different values of TTL varying from 0 to as large as the simulation time (almost 2.5 x lO^seconds). 
Simulations are run 100 times and in each round, a different destination is randomly chosen from network nodes 
based on a uniform distribution. The average latency and average hop count are calculated only for the messages 
that reach the destination. 


The simulation results in Eigure 3b show that decreasing the TTL has a similar overall effect on all of the 
algorithms. Eor larger TTLs, the delivery rate increases, but the buffer occupancy, average latency and hop count 
also increase. MinLat outperforms PRoPHET and MaxProp in terms of delivery rate, average latency, and buffer 
occupancy even in a scenario with a restricted message life time. Although intermeeting times are exponentially 
distributed and the contact graph is based on preferential attachment, in this larger network of 100 nodes, PRoPHET 
cannot reach 100 percent delivery rate even without any restriction on TTL. 


The next step towards a more realistic network model is to consider limits on buffer size. In the next set of 
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Fig. 3: Effect of TTL 



(b) Average Delivery Rate 



(d) Average Buffer Occupancy 

I Performance Metrics 


simulations, we assume that TTL is 10^ seconds so that all algorithms reach their best possible delivery rate. We 
also assume that each node has a limited capacity for keeping the messages. When the buffer occupancy of a node 
reaches its limit, messages from other nodes are not forwarded to it. Moreover, any generated messages at the fully 
occupied node are immediately dropped. Figure shows the performance of algorithms for buffer sizes in the range 
of 0 to 1250 messages. We see that increasing the buffer size improves the delivery rate for all algorithms. It also 
reduces the average latency because as the buffer size increases, nodes can more frequently follow their optimum 
forwarding rules. 

Finally, we assume that the contact duration is limited so that the number of messages during any meeting is 
restricted by an exchange limit. We set the buffer size to 1000 messages so that all the algorithms reach their best 
possible delivery rate. This buffer size implies 50 MB of node memory if each message is 50 KB. We check the 
effect of varying the exchange limit on the network performance. Figure shows the four comparison metrics for 
exchange limits in the interval 0 to 500 messages. As we see in the figure, MinLat cannot achieve the optimum 
average latency for some values of exchange limit, but it still has the best performance in terms of buffer occupancy. 


The simulation results displayed in Figures [3]|^ illustrate that although MinLat has been designed for an ideal 
network model, it has an acceptable performance when we impose realistic conditions such as finite TTL, buffer size, 
and exchange limit. Our final set of simulations explores the impact of including the recursive maximum likelihood 
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Fig. 4: Effect of Buffer Size on Performance Metrics 
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Fig. 5: Effect of Exchange 



(b) Average Delivery Rate 
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estimation of meeting rates. We conduct experiments using both the centralized and decentralized algorithms 
operating on the extension of network I to 100 nodes. We select the A parameters of intermeeting times from 
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a uniform distribution i7[0, 0.01]. The destination node is randomly chosen from the N nodes of the network based 
on a uniform distribution and is fixed throughout the simulation. We run the simulation for 5 x 10^ seconds. We 
examine the error terms in Theorem 4, averaging over all nodes. Figure shows the average absolute difference 
between the estimated expected latencies and the true minimum latencies, i.e., “ ^A:d(B*)|. 

Figure shows the difference between the achieved average latencies and the true minimum latencies, i.e., 

'k Sfc — Lfcrf(B*)|. Figure [^indicates that the decentralized algorithm achieves almost the same estimation 

error as the centralized algorithm, suggesting that the limiting effect is the convergence of the meeting rate estimates 
rather than the dissemination of latency estimates through the network. As expected from Theorem we see that 
the estimated and achieved errors both decay to zero as the time goes by (for achieved latencies, it is almost zero 
for all nodes of the network after t = 7000 seconds). 
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(a) Absolute Estimated Errors (b) Absolute Achieved Errors 


Fig. 6: Absolute Errors of Estimated and Achieved Eatencies. The central mark of each box is the median, the 
edges of the box are the 25th and 75th percentiles, the whiskers extend to the highest values not considered as 
outliers, and outliers are plotted individually. 


Comparing Eigures and shows that for both centralized and decentralized scenarios, the average difference 
between the achieved latency and the minimum latency is less than the average difference between the estimated 
latency and the minimum latency. This is expected, because the estimated latencies are based on incorrect decision 
matrices and estimated meeting rates, whereas the achieved latencies are derived from the actual meeting rates. The 
results suggest that even when there remains substantial inaccuracy in the expected latency estimates (e.g. at time 
t = 3000 seconds), the algorithm identifies a close-fo-opfimal forwarding decision mafrix. 


V. Conclusion 

In this paper, we used an analytical framework to model the opportunistic data transfer among mobile devices 
in Delay Tolerant Networks. In our model, the random intermeeting times of nodes are assumed to be independent 
and exponentially distributed. We formulated the routing/forwarding problem as an optimization problem in which 
the goal is to minimize the sum of expected latencies from all nodes of the network to a particular destination. 
We proved that the solution of this problem is binary, i.e., when an arbitrary node meets any other node in the 
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network, its optimum forwarding rule dictates either to always forward its messages to the encountered node or to 
never forward any messages to it. We also showed that the solution of this optimization problem minimizes the 
expected latency from each node of the network to the destination as well. Based on these results, we proposed 
centralized and distributed versions of an algorithm to find the optimum forwarding decision rules and proved that 
each of these algorithms result in the same solution. In order to evaluate the performance efficiency of fhe suggesfed 
algorifhms in differenf synfhefic and real-world nefworks, we chose four performance mefrics as comparison mefrics 
and compared our proposed decenfralized algorifhm (MinLaf) wifh fhe mosf similar existing approaches. In order 
fo evaluate fhe performance of MinLaf in more realisfic scenarios, we conducfed simulafions in larger nefworks 
wifh pracfical consfrainfs like limifed message life (TTL), buffer size and message exchange. 

One of fhe main confribufions of Ibis work is relaxing fhe condifion of having complete knowledge of meeting rates 
af each node for making fhe forwarding decisions. We used a recursive maximum likelihood procedure (MinLaf-E) 
fo learn fhe meefing rales online and proved ils convergence in probabilily lo fhe same optimal solulion. The validily 
of Ihis fheorelical resull was assessed Ihrough simulafions. Moreover, we compared fhe convergence speed of fhe 
proposed cenlralized and decenfralized algorifhms when fhe meefing rales are eslimaled online. The simulalion 
resulls show lhal fhe decenfralized algorifhm has almosl fhe same convergence rale as fhe cenlralized algorifhm, 
even Ihough fhe nelwork topology is nol known al individual nodes. 

In fulure work, we aim lo explore fhe effecl of lime-varying meefing rales belween nodes, which would molivale 
fhe use of filters fo Irack fhe rales. We also hope lo examine whelher if is possible to derive similar expressions 
for expecfed lalencies and oplimal forwarding rules for cases when fhe inlermeeling times are nol exponentially 
disfribuled or are correlaled. 


Appendix A 
Proof of Lemma[T] 

When node i commences in its routing of a packet, it must first wait a time before it meets one of its neighbors. 
The amount of time before node i meets a specific neighbor j is an exponenlially disfribuled random variable wifh 
parameler Xij. The lime is equal to fhe minimum of fhe exponenlially disfribuled random variables corresponding 
lo all neighbours k £ Si and ils expected value is 


E{T^) = 


1 


Y1 ^ik 

k&S, 


The probabilily lhal j is fhe firsl node lhal i meels is \ ^ is 


Lid{P) = E{T^) + J] + {1 - pij)Lid{P)] 


jes. 


kes. 


(9) 


( 10 ) 
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The last term in ( [T0| ) follows from the memoryless property of the distributions. Suhsituting Q into ( fTO] ) leads to 

Q. □ 


Appendix B 
Proof of Theorem[T] 

Let us assume that the nodes, excluding d, are labelled in ascending order of their expected latency under P*, i.e., 
Lid(P*) < L 2 d(P*) < ■ < L 7 v-id(P*)- For a given matrix P, we denote by P^ all rows of P except i. If we 

fix Pi and Lkd for k £ Si,k ^ j for some j £ Si, then Lid is monotonically increasing with respect to Ljd (see 
Q)- This implies that if we commence with any P and change only pj to decrease Ljd, then all other Lid such 
that j £ Si either decrease or remain the same. The matrix P* must therefore satisfy p* = argminL,rf(P5,pj) 
for all j. Otherwise we could choose an alternative p' that reduces Ljd and hence achieves U{^' 

We can examine the partial derivative of Lid with respect to pij at P' = (P|, pj): 

dLid _ ^ikPik{Ljd(P ) ~ Lkd(P )) ~ 1] 

dPij ^ikPik)"^ 

This derivative has the same sign as: Ljd(P') — equivalently Ljd(P') — Lid(P'). This 

expression for the derivative, together with the requirement that p* = argminLjrf(P|,pj), implies that pT = 0 if 
Lid(P*) < Ljd{P*) and pT = 1 if Lid(P*) > Ljd(P*). Our assumption that the solution is unique implies that 
Lid(P*) 7 ^ Ljd(P*). Otherwise, from ( [T 0 | ), it is clear that we could choose any p*j between 0 and 1 and achieve 
the same Lid(P*), without affecting any other Ljd(P*). This establishes statement (1) of the theorem. 

Although we have established that Ljd(P|,p*) = minLjd(P|,pj), we have not yet shown that P* globally 
minimizes Lid- We establish this by contradiction. Suppose P* does not minimize the expected latency for some 
non-empty set of nodes M' C J\f. In other words, denoting the minimum expected latency achieved via the 
minimization in Q for node i by L*^, we have 

ViGAA': L*^<Lid{P*) ( 12 ) 

Let node s be the node in M' such that L*^ < for all k £ M', k ^ s. Denote by i the ranking of the 
node with greatest expected latency under P* such that Lid(P*) < L*^. Based on the discussion above, for each 
node i £ p*^ = 0 for all k > I and hence p*^ = 0. Node s must have at least one neighbour in 

the set {d, 1, 2,... £}. Otherwise, it could not achieve an expected latency under P* that is less than all nodes 
+ 1,... ,N — 1 (observe from ([T]l that Lsrf(P*) > minp^^>o LjdiP*)). 

The matrix P' that achieves the minimum must satisfy p'^^ = 0 for all k £ M', since for any matrix P' we 

have Lkd(P') > L\^ > L*^. We also have p'^j^ = 1 for A: G 5s n {d, 1,2,... ,1} if Lkd(P') < LsdiP')- For a fixed 
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choice of p'^ the value Lsd(P') decreases if we can reduce for any k such that = 1. The matrix P* 

minimizes for all A; G {1, 2,, £}, implying that p'^ = p|, for all /c G {1, 2,..., £}. Since Lfcrf(P') = ^^^(P*) 
for all k £ SgC] {d, 1,2,, £}, it follows that p^ = p*. For node s, the values of p' for j ^ {1, 2,..., i, s} have 
no impact on Lgd, so we have Ls(i(P*) = ^^^(P') = L*^. This contradicts the original assumption that P* does 
not minimize the latency for all nodes s G M', and thus establishes statement (2) of the theorem. □ 

Appendix C 
Proof of Theorem[2] 

We observe that for all i G M, Gtd > L*^ (since the optimizations are the same). Based on Theorem 1 and its 
proof, the equality holds only if j G ^ for all j G Si such that L*^ < L*^. The statements in the theorem follow 
based on an induction argument. 

Suppose, without loss of generality, that the nodes are labelled in ascending order of expected latency under 
B*. For node 1, the only neighbour with lower expected latency is the destination. In iteration 1, the destination 
is included in A and must be in 5i. Recall that > min;,* =i L*^. Node 1 has the minimum expected latency 
according to the chosen labelling and Theorem 1, except for the destination itself. The relationship thus implies 
that d £ Si. We therefore have Gid = < L*^ < Gjd, and node 1 is selected to be added to A, with bid = 1 

and bjd = 0 for all j A d. Statements la)-c) in the theorem clearly hold after one iteration, i.e., after the addition 
of node 1 to A. 

Assume the same statements hold after the addition of node k — 1 to A. Then, for node k we must have j £ A 
for all j £ Sk such that L*^ < Again this implies that Gkd = L*kd ^ ^*jd — 2 > k. Thus, node k 

is correctly selected for addition to A and the statements la)-c) hold at the end of iteration k. 

It follows that the statements hold for all iterations of the algorithm, and after completion, when A = M, the 
second statement follows. □ 


Appendix D 

Proof of Theorem[3]and Proposition[I] 

A. Proof of Theorem 

Assume that the nodes are labelled in order of ascending expected latency under B*, i.e., Lid{W) < L2d(B*) < 
• • • < Ljv-id(B*)- Denote by Ti the moment of time at which node 1 meets the destination node. For k = 2,..., N 
denote by the earliest time by which node k has met all nodes in the set {1,..., A: — 1} n 5^ in the time period 
(Tfc-ijTfc]. Due to the assumption that the inter-meeting times are exponentially distributed, T/v is finite with 
probability 1. 
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At Ti, node 1 learns its meeting rate with the destination (Ai^). Since the estimated latencies are initialized to 
cx) and due to the update equations in Algorithm the estimation that node 1 has at Ti of the latencies of its 
neighbors i G are upper-hounds, i.e. Lid(l) > Lj(i(B*). As discussed in the proof of the previous theorems, the 
minimizer bj has hid = 1 and bij = 0 for all j / d. At time Ti, since the term involving d in the update equation of 
Algorithm 1^ has its minimum value, the vector m| = bj identifies the same minimum latency Lid{l) = Lirf(B*). 
Hence, immediately after time Ti we are guaranteed that bi = b^. 

At Tfc, node k is aware of the minimum expected latencies Lsd{k) = Lsd(B*) for the nodes in the set Vk = 
{d, 1, k}r\Sk- All other expected latencies are upper hounds, i.e. Ljd{k) > Ljd(B*) for j ^ Vk- The solution b^ 
takes value 1 only for nodes in Vk- The minimizer at time Tk is thus equal to b^ and achieves Lkd{k) = Lfcrf(B*). 
Therefore, imediately after Tk we will have b^ = b|,. This argument applies until just after T^r, at which point we 
have B = B*. Since Tjv is finite with prohahility 1, the statement of the theorem follows. 


B. Proof of Proposition 

Algorithm has converged when all of the nodes have met all of their relay candidates and have identified fheir 
opfimum forwarding rules. Thus, E{Tn), where Tj\f defined above in fhe proof of Theorem is fhe expected 
convergence time. Considering the worst case where an arbitrary node fc > 1 is connected to all the nodes in the 
set {1,..., k — 1}, we have 

E{Tk - Tk-i) = E{ max Xi) (13) 


where Xi denotes the intermeeting time of node k with its neighbour i and follows an exponential distribution 
with parameter Xki- Using a standard result for the expected value of the maximum of non-identical independent 
exponential random variables, we have 


k-l 


k—1 k—1 


E{T, - r,_,) = E - E E 


1 


i=l 


2=1 ^=2 + 1 


Afci + A, 


kj 


k—1 k—1 k—1 

+ EE E 

i=l j=i+l i=j+l 


1 


Afci + Xkj + Xki 


(14) 


An upper bound on this value is E{Tk — Tk-i) < < 


fc-i 

minAfci 


time of Algorithm 


is E{T^) < 


1-1 


2 min Aj 

^li>0 


Therefore, an upperbound on the convergence 


□ 


Appendix E 
Proof of Lemma[2] 


Let’s return to considering the probability decision variable vector p instead of binary decision variable vector. (We 
still expect the optimal solution to be of the binary form). Without loss of generality, we relabel the nodes k £ Si 
by labels 1,..., |5j|. The optimization problem that node i tries to solve in Algorithm follows this general form 
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which is known as linear fractional optimization problems: 


Lid{i) = min 


+ a 


p‘ d^p® + j3 

subject to Ap® < b 


(15) 


where = [Pn, ■■■,Pi\s,\V^ a = 1, C|5^|xi = [XiiLid{i),Xi\s,\L\s,\d{i)V, /? = 0, d|5^|xi = [A^, Ai| 5 ^|]^, 

t> 2 | 5 i|xi = [1) 1)0) ...,0]^,and the elements of the matrix A 2 | 5 .|x| 5 i| are 


Aij — < 


1 if z < |5i| & j = i 

—1 if i > |5i| & j = i — \Si\ 

0 otherwise 

After applying the following parameter changes (the Charnes-Cooper transformation), we have 


1 


1 


X = 


d^p* 


P 


y = 


d^p* 


(16) 


and the optimization problem [T^ converts to 


mm 


c'^x + ay 


(17) 


subject to d^x = 1 
Ax < by 
y>o 

which is a linear optimization problem and can be solved using Linear Programming (LP) solution methods. □ 

Appendix F 
Proof of TheoremH] 

We first prove the statement of the theorem for estimated latencies. Without loss of generality, we relabel the nodes 
such that Lirf(B*) < L 2 d(B*) < ... < LjYd(B*). At node i, MinLat-E forms estimates of the meeting rates with 
the contact graph neighbours, Aik for k G Si, and the expected latencies from each neighbour Lkd{Bt,i). The 
result is a sequence of random variables {Lid{Bt,i)}, with a variable being added to the sequence each time node 


i meets another node. |41| shows that the maximum likelihood estimator of the meeting rates is consistent, i.e.. 


Vz, j G M, Aij A Aij. More precisely. 


V Cij ^ 0 . lim P (I Aij Aij I Cij ) — 1 


t—>oo 


(18) 
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Equivalently, writing = eoXij, we have for any <5 > 0 and eo > 0, there exists a fo > 0 such that for all t > fo- 

-P((l ~ ^o)Xij < Xij^t < (1 + > (1 ~ (19) 


where denotes the estimate of the meeting rate between nodes i and j at time t. 

We show that for any set of meeting rates {Xij}ij^, there exists an e = eo for which the optimum forwarding 
decision matrix in MinLat-E (B*) is the same as the optimum forwarding decision matrix in MinLat (B*) with 
desirably high probability. In order to do so, we find upper and lower bounds (that apply with high probability) 
on the estimated expected latencies for the optimal decision matrices identified by both MinLat and MinLat-E. 
We first demonstrate a relationship between the estimated and true expected latencies that would hold if the nodes 
employed the optimal decision matrix B*. We show that there exists a ts^i such that for each node i G M, with 
probability greater than 1 — 5, for any positive 5, we have for all t > ts/. 

< il±^^L,,(B*) (20) 


We derive ( |20l ) by induction. Erom the arguments made in Theorem 3, we know that under B* an arbitrary node i 
will not forward any messages to a node that does not belong to the set {1, ..,i — l,d}. Eor i = 1, after time ts,i 
such that ( [T^ holds, we have, with probability greater than 1 — <5 for t > tsx- 

“ TA-A^ < l^id,t[B , 1) - ^ 1 ^ - X - (^^1 

i + eo i + eo Md Xid,t Md J- — eo 

Suppose ( [20| ) holds for all the nodes 1,... ,k — 1. Denote by > t the last meeting between node j and k that 
occurs subsequent to ts^k-i but prior to a considered time t. Then we can identify a so that the following 
relationship holds with probability greater than 1 — <5 for t > 


LM,t{B*,k) = 


<( 


1 


^jeSk 
1 


{l + Jl,^SkbljXkjLjdXkiB*,j)) 


(1 + eo) 


k-2 


< 


1 ^o'^jeSk^kj^kj 

(l + eo)"^-' 




(1-eo)^ 


-LkdiB*) 


(22a) 

(22b) 

(22c) 


Here we have chosen to be sufficiently large such that the inequality on the second line holds with probability 
exceeding 1 — 5. Similarly we can identify a so that the lower bound in (^l holds with probability greater than 


1 — (5 for t > tik- By taking ts^k = we see that ( [20| ) holds for node k as well, and by induction, 

holds for all nodes i G M. 

We now turn our attention to the forwarding matrix determined by MinLat-E (B^). We first consider a scenario 
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where the estimates of the meeting rates are frozen after a certain time tf. The minimization procedure in MinLat-E 
is the same as in MinLat, hut operates on the estimates of the meeting rates. If these estimates are held constant, then 
the results in Theorems 1-3 apply, with the suhstitution of anywhere we make use of Ajj. With prohahility 1, the 
optimization algorithm will thus converge after a finite time t', and the estimated expected latencies will he consistent 
across the network. Hence, there exists a labeling {l,2,...iV} for which , ^ ) 2) < • • • < 

, N) We can now employ the same argument that was used above for B* to determine that there is a 
finite time ^ such that the following bound holds for all G Af with probability greater than 1 — (5 for all f > 




hdi^t) 


(23) 


(l + go)fc ^ (l-6o)^ 

In the actual MinLat-E algorithm, the meeting rates A are not frozen after to, but continue to be updated as more 
meetings occur. This only results in the probabilistic bounds on A being tighter, and hence can only tighten the 
bounds on 

To avoid having node-specific bounds on the accuracy of the estimates, we can rewrite the bounds as: 


< LUB, k) < (24) 

This bound holds for both B = B* and B = B^ with probability at least 1 — (Jq after some time to. 

Our goal is to show that there exists a moment of time after which Bt = B* is true with desirably high 
probability. We can accomplish this by showing that there exists an eo (and thus an associated time to) for which 
the upper-bound on is less than the lower-bound on Li^CBtji) for any B^ / B* for all t > to. If this 

is the case, then with probability exceeding 1 — (fo> the optimization procedure that derives B^ will set it to B*, 
because it minimizes the estimated latencies. Hence, eo should satisfy 




(25) 


which leads to 

In(JC) 

g2JV-l _ 1 

^0 ^ In(JC) 

g2JV-l -)- 1 


(26) 


where K = . 

Now that we have established that after a finite amount of time Bt = B* occurs with a probability desirably 
close to one, we can show that {Lid{Bt,i)} A Lid(B*) for any i £ N using the following properties: 

1) If Xn X, then g{Xn) 9 {X) (Continuous Mapping Theorem); 

2) If Xn 4 X, then Xn 4 X; 

3) If Xn-^cG M, then Xn A 
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4) If Xn A X and Yn c gM., then g{Xn, Yn) A g{X, c) (Slutsky’s Theorem), 
where A denotes the convergence in distribution and 5 : M x M —M is an arbitrary continuous function. 

Again consider the node labelling such that < L 2 d(B*) < • • • < For node 1, {Lid(Bt, 1) = 

J-} A Li(i(B*) = ^ which is obvious from property 1. For any node A: > 1, if {Ljd(Bt,j)} A Ljd(B*) holds 

•^Id 

for any — 1}, then due to properties 2-4 we have, 


Vj G {1,...,A:-I,d}n5fc : 

{XkjLjd(Bt, k)} —)■ XkjLjd(B*) G M , 


(27) 


and 


{S 




(28) 


Property 1 in combination with ( |27| ) and ( |28| ) results in the statement of the theorem. 

For the achieved expected latencies, Lkd(Bt), as opposed to those estimated at the nodes, the proof is more 
straightforward. For a given decision matrix, B*, the expected latencies Lkd(Bt)^ are functions of the true meeting 
rates A and are thus not random variables. Thus, the sequence {Lid(Bt)} is only a function of the random sequences 
{Ajj}, j G Si via the optimization that determines B^. Since we have established that B^ converges in probability 
to B*, it follows that {Lid{^t)} converges in probability to Ljd(B*) due to property 1. 

□ 
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